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[57] ABSTRACT 

A system and method for the optimized storage and retrieval 
of video data at distributed sites calls for the deployment of 
"Smart Mirror" sites throughout a network, each of which 
maintains a copy of certain data managed by the system. 
Every user is assigned to a specific delivery site based on an 
analysis of network performance with respect to each of the 
available delivery sites. Generalized network performance 
data is collected and stored to facilitate the selection of 
additional delivery sites and to eosiue the preservation of 
improved performance in comparison to traditional net- 
works. 
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SYSTEM AND METHOD FOR OPTIMIZED 
STORAGE AND RETRIEVAL OF DATA ON A 
DISTRIBUTED COMPUTER NETWORK 

The invention is a continuation of application Ser. No. 
08/733^16, filed Oct. 18, 1996 now U.S. Pal. No. 6,003, 
030, and a continuation-in-part of Ser. No. 08/660^40, filed 
on Jun. 7, 1996, now U.S. Pat. No. 5,956,716 which is a 
continuation-in-part of Ser. No. 08/486,517, filed Jun. 7, 
1995. 

The invention relates to a system and method for dis- 
tributed data storage and retrieval, and more particularly, to 
a system and method whereby a user can acquire network 
performance information for a dynamic and distributed 
multipurpose network, and use this information to identify 
and select optimum delivery sites or servers from which to 
receive computer data, specifically multimedia content. 
Such delivery sites and servers are selected so as to increase 
network capacity, distribute server load, and reduce trans- 
mission delays between the server and the user. 

BACKGROUND OF THE INVENTION 

The Internet is a loose network of connected computers 
spread throughout the world. A message can be sent from 
any computer on the Internet to any other by specifying a 
destination address and passing the message from computer 
to computer via a series of "hops." Each computer, router, or 
"node" on the Internet has a unique Internet address. When 
an intermediate computer or router receives a message in 
transit, the computer checks the intended destination of the 
message and passes it along accordingly. 

The Internet is growing, in terms of both size and 
sophistication, at a rapid rate. In the past, most users of the 
Internet were academic, research, or institutional users; the 
Internet was primarily used at that time to transmit and 
receive electronic mail and network news and to allow 
transfer of computer files. However, since the introduction 
of the World Wide Web (also known as the "Web" or the 
"WWW") several years ago, the Internet has begun to host 
increasing amounts of other types of data of general interest, 
namely representations of images, articles, etc. 

The Web protocol and language establish a graphical 
means to navigate the expanses of the Internet. "Web pages," 
often consisting primarily of text and graphical material, are 
stored on nuimerous computers, known as "Web servers," 
throughout the Internet. A software program known as a 
"browser** can be used to access and view Web pages across 
the Internet by specifying the location (i.e. Internet address) 
of the desired Web page. When a Web page is accessed, its 
information is transmitted from the remote computer (server 
or delivery site), wherever in the world it may be located, 
across the Internet, to the user. 

In recent times, the Web has begun to host highly sophis- 
ticated types of multimedia content, such as audio and video 
data, and computer software. Compared to first generation 
Web content, namely text and still images, audio clips, video 
clips, and software programs have extremely high storage 
and bandwidth requirements. 

At present, it is difficult, if not impossible, to provide 
sustained high-speed transmission of large audio/video files 
over a multi-node link on the Internet. Because the data is 
often transferred from afar, many factors can cause the delay 
or even loss of parts or all of a transmission. It is generally 
not critical if a user experiences minor delays in receiving 
small graphic or text files. However, it is recognized that 
real- time data such as video has very specific and stringent 
timing requirements for data transfer and display. 
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Unformnately, the present design of traditional Interaet- 
like data networks is based on the principle that delays and 
significant data transmission rate variations are acceptable 
for ordinary data (e.g. text and still images). Consequently, 

S because of the high value of permitting access to text and 
graphical information firom locations around the world, such 
transmission defects are considered acceptable, and the base 
capacity of the Internet is somewhat "oversubscribed" to 
reduce data transmission costs. In other words, the timeli- 

10 ness of network data transmission has been significantly 
compromised in order to render relatively insignificant the 
aggregate cost of long distance communication connections. 

In order to successfully transfer audio-video data across a 
message -oriented network such as the Intemet, for any more 

15 than a few users, network resources should be committed in 
a manner facilitating timeliness of transmittal A system 
using committed network resources generally cannot take 
advantage of the existing pricing scheme of shared networks 
like the Intemet, since it cannot participate in the sharing of 

^ network resources on a data packet by data packet basis. 
Video data must be transmitted to the exclusion of lower- 
priority data. Transmission costs thus become significant, 
especially when the connection is "long distance" or when 
the connection is continued over an extended period of time. 

Another consequence of the timeliness vs. cost compro- 
mise discussed above has been the seemingly indiscriminate 
topographical design of the network. Since delays and 
throughput variations have traditionally been excused in 
favor of low cost, the configuration of the Internet infra- 
structure has also been driven by cost considerations. 
Accordingly, the interconnection efficiency of the network 
has rarely been considered. The rapid growth of real time 
data is changing this requirement. 

It is recognized that inadequate data transfer performance 
of time -sensitive data on the Internet is typically caused by 
four factors: packet loss, excessive server utilization, the 
relatively low capacity of the network infrastructure, and 
inherent delays in the network hardware. Packet loss, in 
particular, is caused by inadequate infrastructure and lack of 
robustness in routing. The inherent delays are believed to be 
caused by, among other things, the lack of flow control 
between adjacent nodes in a multiple -node path on the 
Internet. 

45 UnUke smaller text and graphic files, relatively large 
video files can take several minutes (or more) of 
"streaming," or constant data flow. Consequently, the usual 
network performance problems are exacerbated. Network 
bandwidth, or the data-carrying capacity of a particular 

50 network, is limited. Thus, packet loss and delays increase. 
Long delivery times consume a large amount of server 
capacity for a long time, decreasing the resources available 
to other users. Accordingly, because the network infrastruc- 
ture becomes increasingly congested, packet loss and delays 

55 continue to increase, transmission times rise, and server load 
increases further. 

This pattern exemplifies a "downward spiral" of network 
performance, which can be driven by the attempted trans- 
mission of large data files such as video clips. As long as 

60 network traffic remains within the hmits imposed by net- 
work bandwidth, network performance will remain accept- 
able. However, whenever peak network loads exceed 
capacity, the downward spiral described above will begin, 
causing increasing periods of poor network performance. 

65 As discussed above, a browser program can be used to 
access and view Web pages across the Internet by specifying 
the location (i.e. Internet address) of the desired Web page. 
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or more commonly, by "hotlinking" to Web pages. Commoo 
browsers are Lynx, NCSA Mosaic, Netscape Navigator, and 
Microsoft Internet Explorer. The desired Web page is speci- 
fied by a uniform resource locator ("URL"), indicating the 
precise location of the file using the syntax "http:// 
internet, address/directory/filename .html". 

Web pages arc generally described, in terras of layout and 
content, by way of a language known as "HTML" 
(HyperText Markup Language). Any particular computer 
linked to the Internet can store one or more Web pages, i.e. 
computer files in HTML format, for access by users. 

Hotlinking from one HTML Web page to another is 
accomplished as follows. The user first accesses a Web page 
having a known address, often on the computer located at 
the user's ISP (Internet Service Provider). The ISP is the 
organization providing Internet connectivity to the user. That 
Web page can contain, in addition to textual and visual data 
specified in HTML format, "links/* or embedded informa- 
tion (in the form of URLs) pointing to the Internet addresses 
of other Web pages, often on other computers throughout the 
Internet. The user, by selecting a link (often by pointing and 
clicking with a mouse), can then access other Web pages, 
which can in turn contain further data and/or additional 
Unks. 

Various extensions to HTML, such as Netscape's 
EMBED tag, allow references to other data to be embedded 
into Web pages. Some browsers are not capable of handhng 
data other than text and images. Other browsers can handle 
the data in various ways. NCSA Mosaic, for example, 
handles references to unlmown types of data by allowing the 
data to be downloaded to the user's computer, and then 
optionally invoking an external program to view or manipu- 
late the data. Recent releases of Netscape Navigator and 
Microsoft Internet Explorer take the concept one step fur- 
ther: a browser extension, or "plug-in," can b& automatically 
invoked to handle the data as it is received from the remote 
Web page. Other means, such as network program "applets" 
written in the Java language (or a similar language), can be 
used to extend the functionality of the browser environment 
or network. 

Digital multimedia data can have extremely high storage 
and bandwidth requirements. In particular, video files can be 
very large, from approximately 10 megabytes to 10 
gigabytes. In order to play video files at speeds approaching 
their recorded rate at a user's terminal, the files have to be 
delivered at a fast, constant speed. Too slow, and the image 
plays back slower than originally recorded. If the speed is 
uneven, then the video appears jerky, like an old-time movie. 

The network design compromises discussed above gen- 
erally adversely impact the transmission of audio and video 
data across the Internet. While a user using a browser to 
"surf' the Web might not notice minor delays and transmis- 
sion rate variations while retrieving text and still images, 
such defects become apparent and significant when real-time 
audio and video information is accessed. 

In an attempt to solve these problems, Internet content 
providers sometimes spread popular content around the 
Internet on various servers or deUvery sites known as 
"mirror sites." Each mirror site contains information that is 
essentially identical to that of the original site. For example, 
if a popular Web site is located in New York, mirror sites 
might be located in Los Angeles, London, and Tokyo. 
Accordingly, if a European user is having difficulty access- 
ing the original New York site, he can hotlink to the mirror 
site that is geographically closest, i.e. London. 

However, mirror sites have several disadvantages. For 
example, mirror sites may be widely distributed 
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geographically, but may not be efficiently distributed on the 
network in terms of actual usage, network traffic, etc. Thus, 
New York and Los Angeles mirror sites might both be 
connected to the same national Internet service provider's 
5 network, meaning that difficulty in accessing one of the sites 
might also affect the other. 

Furthermore, mirror sites might not be optimally placed to 
reduce load on each server. Although an "educated guess" 
might be made as to where a mirror site should be located, 
10 actual usage patterns might differ. Furthermore, there is no 
guarantee of enhanced performance. The bandwidth of the 
mirror site might be lower than that of the original site, or 
it might be overloaded for other reasons. 

Moreover, mirror sites arc often hosted on a voluntary 
basis. If a Web site is extremely popular, and a service 
provider determines that the subject matter might be of 
interest to its subscribers, that service provider might agree 
to host a mirror site of the original Web site. Such an 
arrangement would be attractive to host of the mirror site 
'^^ because people would be drawn to the mirror site, and might 
hotlink to other content hosted there. On the other hand, such 
voluntary alliances typically are not rehable and might be 
severed at any time. 

In essence, a mirror site offers a secondary source for data, 
^ which may or may not be available, and which may improve 
user convenience, but which does not address network 
bandwidth or efficiency. A mirror site does not account for 
performance characteristics of the network, nor identify 
available bandwidth which could be used to efficiently 
transmit video data while still taking advantage of the 
existing low-cost pricing schemes such as those on the 
Internet. 

Currently, there is no guidance in selecting optimal loca- 
tions for delivery sites, nor is there a known method per- 
mitting a user to determine which mirror site to connect to 
that will ensure optimtmi performance. In fact, the use of a 
traditional mirror site is voluntary. Typically, a user will try 
to access the original site (or a known mirror site), and will 
switch to another mirror site only if performance is found to 
be insufficient after one or more attempts. This approach is 
an inefficient utilization of network resources. Clearly, mir- 
ror sites are not an optimum solution to the problem of 
overloaded Web sites. A principal reason for tlais, among 
others, is the failure to consider network performance. 

Network analysis, particularly the performance of specific 
paths and hnks over the Internet, is well known and devel- 
oped. For example, the "ping" program allows a computer 
connected to the Internet to determine whether a remote host 

50 is accessible. However, the ping program uses a low-priority 
network protocol known as the I CMP protocol, and accord- 
ingly does not provide meaningful performance analysis 
information. The "traceroute" program follows the transmis- 
sion of a message from a computer to a remote host, tracking 

55 delays along each link, and determining the path taken by 
the message. The traceroute application can be used to map 
the flow of data. However, it lacks the ability to provide 
meaningful performance analysis information. Traceroute 
only provides route information for a message propagating 
in one direction, and only for one instant in time. 

Moreover, only the cormectivity characteristics of paths 
leading to and from the single computer nmning the tests are 
typically determined; expanding the scope of testing is 
possible but logistically impracticable, since the Internet is 

65 so large. 

Traditional network analysis techniques such as the 
"ping" and "traceroute" programs offer a view of network 
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connectivity but provide little understanding of what per- The delivery site cbosen by the configuration utility is 

formance can be expected from providers and mirror sites then used by that user for the retrieval of all content 

across the Internet. Therefore, only "guesses" can be made managed by the delivery system service provider 

as to where delivery or mirror sites should be located or Consequently, when the user is browsing Web content, and 

which mirror sites should be used to optimize performance. 5 finds a particular item, e.g. a video clip, that is managed by 

Accordingly, a need exists for a method of determining the service provider's delivery system, the client software 

overall network performance, A further need exists for a will automatically retrieve it from the specified "Smart 

system applying that method to enable content providers to Mirror" delivery site. Site preferences and default sites can 

dynamically locate data delivery or mirror sites at optimum be updated periodically on request, at specified times, or in 

network locations, and to allow users to select optimum lO response to changes in network load and traffic, 

mirror sites from which to receive data. Moreover, because the configuration utility of the inven- 

o Tx*\>r Dvr r^T7 TTJc TXT\/t:KmrMvr ^ performing various network tests and providing the 

SUMMARY OF THE INVENTION ^^^^ scTvicx^ provider, valuable data on system 

The invention is directed to a system and method for the and network performance is available. Such data provides 

optimized distribution of Web content to sites located around 15 information on which "Smart Mirror" delivery sites are 

the Internet. An intelligent mirroring scheme, called here performing effectively and which are not, which Smart 

"Smart Mirroring," is used to determine the need for and Mirror delivery sites are overbaded, and what portions of 

distribution of mirror sites and to direct user requests for the Internet might benefit from the addition of more delivery 

certain Web content to an optimum mirror site. sites or capacity. 

A number of "smart*' delivery or mirror sites are used to ^ Such data also makes it possible to perform such sophis- 

distribute popular Web content to various parts of the licated network analysis as end-to-end performance 

Internet. A comprehensive scheme of network analysis, measurements, workload characterization, route stability, 

based on tests performed by a large number of users, is used and outage metrics. Thus, from an engineering standpoint, 

to interactively determine the preferred locations for the the mirror service provider can continue to ensure that 

sites, and to determine the optimum sites to be used by each improved performance is being provided. From a marketing 

individual user. perspective, content providers can be told where to locate 

Accordingly, because each individual user is routed to a Smart Mirror or delivery sites for improved perfonnance. 

Smart Mirror or delivery site that provides improved and what ISP provides improved delivery, 
performance, overall network congestion is reduced. In most 

cases, the improved server is located electronically close to '° BRIEF DESCRIPTION OF THE DRAWINGS 

a user in order to decrease the number of network connec- pj^ ^ jg ^ ^^^^y. diagram of an illustrative network 

tions over which data must travel, thereby reducing packet topology of a system according to the invention, including 

loss and delay. multiple users and multiple content providers; 

Furthermore, network analysis results allow message traf- 35 pj^. ^ is a flowchart describing the operation of the 

fic to be routed away from those delivery sites and network configuration utility used in a system according to the 

regions that are already overloaded, and toward underuti- invention- and 

lized servers and networks. This results in an improvement J. » , , , • .l r r * 

in throughput as seen by each user, and wiU thereby increase "0. 3 is a flowchart describing the operation of a chent 

the appeal of the content offered by content providers using "^^^ "> » ''y^'^"' '° mvenUon. 

the system. Content providers are able to reach a larger DETAILED DESCRIPTION OF THE 

number of users across the Internet without suffering sig- INVENTION 
nificant decreases in performance. 

A system according to the invention begins with an The invention is described below, with reference to 

original Web site and at least one additional delivery (or 45 detailed illustrative embodiments. It will be apparent that the 

mirror) site. Each user desiring to use the system will be invention can be embodied in a wide variety of forms, some 

provided, in a preferred embodiment, with software which of which may be quite different from those of the disclosed 

includes a configuration utility and a cUent program. The embodiments. Consequently, the specific structural and 

configuration utility is used first to determine which delivery functional details disclosed herein are merely representative 

sites provide improved performance for that particular user. 50 anc^ do not limit the scope of the invention. 

In one embodiment of the invention, the configuration Referring initially to FIG. 1, the Internet 10, which is 

utility first downloads a "delivery site file" from a service intended to be representative of wide-area communications 

provider. This delivery site file contains a list of available networks in general, is depicted as a "cloud." The Internet is 

delivery sites and a list of network tests to be run. The types known to be an interconnected network of a large number of 

of tests and frequency of testing to be performed may be 55 computers. Although Internet-connected computers that are 

specified in the delivery site file, as dependent on the number "geographically" near each other can be "electronically" 

of users testing the network and the estimated drain 00 near each other on the Internet, such is not usually the case, 

network or delivery system capacity. However, one computer connected to the Internet can com- 

The configuration utility will run a subset of the tests municatc with any other computer connected to the Internet; 

specified in the delivery site file. The test results show which 60 the message wiU most likely travel over a path comprising 

delivery sites yield improved performance for the user, and a sequence of links, or "hops/* between computers that are 

also contain information on various generalized network directly connected to each other. 

capabiUties from the standpoint of the user running the tests. A first user tenminal 12 is also depicted in FIG. 1. The first 

The network test results and the identity of the chosen user terminal 12 is connected to an Internet service provider 

delivery site will be sent (via e-mail in one possible 65 (ISP) 14, which is typically just a computer, router, or 

configuration) back to the delivery service provider for terminal server connected to the Internet 10. An ISP 14 can 

incorporation into the service provider's database. host additional user terminals, such as a second user terminal 
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16. Other ISPs, sucb as a second ISP 18, are also connected lenninal 12, Although the configuration utility 34 and the 
to the Internet 10. A third tiser terminal 20 is shown client program 36 arc shown in FIG. 1 as a part of only the 
connected to the second ISP 18. Only three user terminals first user terminal 12, it should be recognized that any user 
arc shown; however, it should be recognized that the number terminal, such as terminals 16 and 20, participating in the 
of concurrent tisers of the invention is unlimited, subject to S system will use such software. A user desiring to participate 
the operational dctaib set forth below. in the system can obtain the software comprising the con- 
As is known in the art, content providers are also con- figuration utUity 34 and client program 36 directly from the 
nected to the Internet 10. A first content provider 22 might MSP 32, or through traditional retail or other channels (such 
provide a certain kind of content, for example sports scores as being part of the browser or operating system of the 
and highlights. A second content provider 24 might provide lo computer). It should be noted that the functions performed 
a different kind of content, for example business news. by the configuration utility 34 in the described embodiment 
TradiUonaUy, if a user (such as the one using the first user °^ invention can be integrated into general Internet 
terminal 12) wished to access the content provided by the appUcation software, such as a browser or other network 
first content provider 22, the terminal 12 would query the application; a stand-alone program is not necessary, 
first content provider 22 direcUy. A request message would a preferred embodiment, the configuration utility 34 
propagate fi*om the terminal 12, across the Internet 10, to the mxist be run by the user, either by command or automatically, 
content provider 22. The content provider 22 would send the before the user terminal 12 will have access to the system, 
desired data across the Internet 10 back to the terminal 12. xhe operation of the configuration utility 34 is shown in 
Several delivery, or "mirror*' sites are shown connected to detail in FIG. 2. 

the Internet 10 in FIG. 1. A first delivery site 26 might be ^ ™ - ^. . *i. 

, _, ,t 1. cut. » c £ * .1 The configuration ulihty 34, when first run on the user 

bcated a small number of hops from he first user tennmal ^^.^^^ 12, retrieves a delivery site file (step 40) from the 

12. A second delivery site 28 might be located further away 33 (pjg „ ^^^^ ^j^^^j ^ ^-^^ g,^ 

from the first user termmal 12, but close to the third user . . , ... ^. <: *• *i* ia\ j 

. , ^ . , , ,. . -ift . . t ... (e.g., it was received with the configuration utility 34), and 

terminal 20. A third dehvery site 30 might be as close to the ^jv •* ci • a- - n *u j r ci 
. . , • 1 -^A .u J J 1- •* ID - A 25 that delivery site file IS sufBciently new, the dehvery site file 

third user termmal 20 as the second delivery site 28 is. As , . • j r *u i 1 1. j j- t <r*u * - i 

, , , . can be retneved from the local hard disk of the user terminal 

previously noted, a user and a provider or dehvery site that i-i t-l- j i- ^^^ * • i* * n -i ui 

^ \, u .t- • .1. « 1 12. This delivery site file contains a hst of all available 

are ceoeraphically near each other might not be elec- ... •* / i_ j ^^ •* ir io jam j r * 

, T . . J ■ .L dehvery sites (such as dehvery sites 26, 28, and 30) and a list 

tronicall/ near each other on the Internet. By decreasing the - , . . . . ^ ^ in t« 

. „ ^. , , , . . J of network tests to be run at the user terminal 12. In the 

electromc^ distance between the user and the provider or . * r.u • «u u f , 

. , , r , 1 30 context of the mvention, there can be as few as two dehvery 

dehvery site, the number of network connections and routers . ^ „^ „ 

^ . . V , , . , sites, or if the number of users lustines it, as many as several 

over which data must travel can be decreased. j in. u c -* - ■ • i • iL. j 

thousand. The number or sites in pnncipal is unlimited, with 

As discussed above, the Smart Mirroring system acts to ^^^^ available delivery site represented in the delivery site 

improve network performance by decreasing the incidence g^^. 

of the foregoing network problems. Packet loss and delay 

problems are generaUy decreased by reducing the number of Th^ delivery site file is generated by the database from 

network connections over which data must travel, although within the MSP's computer system. The database applica- 

in some cases, the network testing procedure of the inven- "ses information about the user to dynamicaUy deter- 

Uon shows that some longer paths provide better throughput mine the optimum tests to run. Consequently, the delivery 

than some shorter paths. Very little packet loss, and essen- site file need not contain entries for every dehvery site in 

tially no delay, occurs in network cable; it typically is caused existence; the list can be tailored to include only those sites 

by overloaded network storage and routing devices. Because which appear appropriate or feasible, 

the Smart Mirror sites of the invention are located electroni- Initially, the magnimde of run-time variation in test con- 

cally near each user, packet losses and delays are reduced. figurations for the dehvery system users is low; that is, the 

The problem of excessive server utitization is reduced first group of users all run essentially the same tests. As the 

because multiple dehvery sites share the load Uiat typicaUy delivery service grows, however, the intensity of each user's 

would have been handled by a single server. The relatively testing is reduced in order to compensate for the increased 

low capacity of the network infrastructure becomes less of magnitude of testing network-wide. The scope of testing and 

a problem, because data retrieved from parallel delivery sites the number of delivery sites tested both can be narrowed to 

in different locations generally need not travel over the same further reduce the aggregate load of network testing, 

network links. ^ , . . . -t, . 

„ n J, ... ■ . J r In one embodiment, the dehvery site file will have a 

For the purposes of describmg this invention, a dehvery . ^ ti r n 

site is a "node" on the network which may store data or other generally as follows: 

files, such as software code, for delivery. The term can also 1, File Revision Number and Message. The file includes 

include a site which is responsible for data dehvery, includ- 55 this field to determine whether a new version of the 

ing mirror sites, content providers, and servers for broadcast configuration utility 34 is available. If the revision 

video streams or Web sites. number in the delivery site file is higher than the 

In the system, a mirror service provider (MSP) 32 is version number for the configuration utiUty, configu- 

connected to the Internet 10. The MSP 32, which exercises ration is not allowed. Instead, the user would be 

a management function over the distribution of delivery sites 60 prompted to acquire a newer version of the configura- 

26, 28, and 30, and over the allocation of requests to the tion utility 34. File revision verification as described 

original and dehvery sites from user terminals 12, 16, and herein ensures that the most up-to-date delivery site 

20, includes a database capable of transmitting and receiving selection algorithms are appUed to the test data gener- 

data over the Internet 10. ated by the configuration utility 34. 

This management function is faciUtated by the use of a 65 2. A list of available Smart Mirror dehvery sites. For each 

configuration utility 34 and a client program 36 run within available detivery site, the folio wmg information is 

a storage medium (i.e. random access memory) od the user provided: 
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a. Host name. In the known Internet format of 
"www.server.com." 

b. IP Address. A numerical Internet address in the 
known foraaat. The address is presently a 32-bit 
number of the fonn w.x.y.z,, where w, x, y, and z are 
each in the range of 0 to 255. 

c. Alternate Name, An informal name such as "The 
First Mirror Site." 

d. A list of tests to be executed. For each test, the 
following infomiation is provided: 

i. Test ID. Each type of test has a unique identifier 
known to the configuration utility 34. 

ii. Weighting factor. Each test will be weighted by a 
specified percentage. 

iii. Frequency. Each test is not necessarily run every 
time. This field specifics a probability, determin- 
ing how often a particular test will be run. 

iv. Additional Inforaaation (optional). For certain 
tests, additional information may be needed. 

e. Site Preference Level. Each site can be given a 
weighting, or preference level, between, for 
example, 1 and 100. As discussed below, aggregate 
data in the MSP's database is used to perform 
network usage analysis not possible with only the 
single user's instantaneous end to end testing. The 
weighting factor provided here is used to incorporate 
the lest results received from the service provider's 
database. This weighting factor is also used to limit 
assignment of new users to a delivery site once a 
predetermined maximum usage level has been 
reached, 

f. Test Site Flag. If this flag is enabled, the foregoing 
tests will be run, but the site will not be assigned as 
a delivery site even if it yields the best performance. 

g. Content Provider Groups. Each site can belong to 
one or more content provider groups, thereby mir- 
roring only certain content. If a user is not interested 
in the types of data hosted by a particular delivery 
site, then it does not need to be tested. 

The configuration utility 34 then queries the user (step 42) 
for various items of information needed in the configuration 
process, for example, the user's name, e-mail address, 
password, modem speed, and information related to access 
control (e,g. what levels of various attributes are viewable 
by the user). The access control mechanism will be dis- 
cussed in further detail below. In one embodiment of the 
invention, the information received from the user is 
encrypted and stored in a configuration file on the user 
terminal 12. 

' The configuration utility 34 then detennines whether the 
user terminal 12 is connected to the Internet (step 42). If not, 
it will initiate a connection (step 44) or prompt the user to 
do so. 

A series of network tests is then performed (step 46). One 
or more tests can be performed for each available site listed 
in the delivery site file; not all sites in the file need to be 
tested. 

The following test types are presently considered to 
provide useful data: 

1. Ping. Provides information on whether a remote server 
is reachable, and if so, how long it takes for a low- 
priority message to travel round trip from the user 
terminal 12 to the remote server and back. Ping is a 
simple test usefiil in deciding whether a site is available 
for further evaluation. Excessive times returned by the 
ping application can be used to eliminate delivery 
systems which are far too "slow" for effective infor- 
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mation deUvery. This test is used by the terminal to 
reduce the number of deUvery sites tested, 

2. Traceroule. Provides information on what route is taken 
by a message £rom the user terminal 12 to a remote 
server, including what systems are used along the way, 
and how long each hop takes. Traceroute is tised by the 
configuration program 34 to document the path of 
information transmission. Several traces with differing 
results might indicate that the stability of the route from 
a particular user to a specific server is not acceptable. 
Previously aggregated data on particular routes, from 
the service provider's system database, may also influ- 
ence the decision to choose a particular delivery site for 
a ^ecific user. Route stability is the primary consid- 
eration. 

3. Reverse Traceroute. Provides information on what 
route is taken by a message from a remote server lo the 
user terminal, including what systems are used along 
the way, and how long each hop takes. Reverse Trac- 
eroute is used by the configuration program to docu- 
ment the path of information receipt. Several traces 
with differing results might indicate that the stability of 
the route from a particular server to a specific user is 
not acceptable. Previously aggregated data on particu- 
lar routes, from the service provider's system database, 
may also influence the decision to choose a particular 
delivery site for a specific user. Again, route stability is 
the primary consideration. 

4. Dynamic Traceroute. Similar to traceroute or reverse 
traceroute, but between any specified pair of computers 
on the Internet. Dynamic Traceroute is used by the 
configuration program to document a path of informa- 
tion transmission. Several traces with differing results 
might indicate that the stability of the route between 
two network locations is not acceptable. Previously 
aggregated data on particular routes, from the service 
provider's system database, may also influence the 
decision to choose a particular deUvery site for a 
specific user. As above, route stabflity is the primary 
consideration. 

5. Name Server Resolution Delay. If the numeric Internet 
address is unspecified, a name server lookup is per- 
formed to determine what numeric address corresponds 
to the desired host name. This process can take a 
substantial amount of time. 

6. Throughput. A sample file is downloaded, or partiaUy 
downloaded, from the remote server to determine the 
actual throughput in bytes per second. 

7. Throughput variation. A sample file is downloaded, or 
partially downloaded, from the remote server to deter- 
mine if the throughput is relatively constant or fluctu- 
ating. 

8. Error rate. A sample file is downloaded, or partially 
downloaded, from the remote server to determine if the 
transmission is subject to transmission errors. This 
information is obtained by counting the number of error 
message packets returned to the user, thereby learning 
of the error rate from the user to the server, and by 
tracking the number of bytes received as a fraction of 
the number of bytes transferred, thereby learning of the 
error rate from the server to the user. 

9. Packet Fragmentation. A sample file is downloaded, or 
partially downloaded, from the remote server to deter- 
mine it the transmission is subject to firagmentation or 
out-of-order packet reception. 

10. Capacity Query. If the remote server is so enabled, the 
configuration utility 34 queries the server to determine 



05/24/2004, EAST Version: 1.4.1 



6,154,744 



11 



12 



10 



15 



20 



25 



its iransmission capacity and its average load. This 
information is collected via the Simple Network Man- 
agement Protocol ("SNMF'), which is supported by 
nearly all Internet servers. 
U. Macroscopic Network Analysis. The data accumti- 
latcd by the MSP database offers a global view of 
network behavior. This information permits the Smart 
Mirror system user to have a historical view of the 
performance of the available delivery sites. The accu- 
mulated data is manipulated by the delivery system 
database to analyze network performance, in order to 
emphasize usage in high-capacity areas of the network, 
while deemphasizing usage in areas already experienc- 
ing diminished performance. 
Information on how each of the foregoing tests are 
performed is well known in the art of network analysis. See, 
e.g.. Bob Quinn and Dave Shute, Windows Sockets Network 
Programming (Addison-Wesley 1996). In one embodiment 
of the invention, testing is accomplished by performing a 
"ping" test to verify whether a server is reachable, a series 
of small (e.g. 20K) downloads, a series of large (e.g. 200IQ 
downloads, and "traceroute" and "reverse traceroute" tests 
to document delivery paths. 

The traceroute information is used by the MSP 32 to 
correlate test data to information in its database; in that way, 
particularly bad network links and servers can be identified. 
Such information is provided in the delivery site file dis- 
cussed above; if a particular link or server is known to be 
unreliable, based on information obtained from other tisers, 
an individual user can be routed away from it, even if a 30 
single test gives good results. 

The short downloads are used to determine server capac- 
ity. The name server resolution delay can be determined by 
such a test, as well as the time it takes a server to begin 
sending data. The later result is strongly related to server 
load, capacity, and performance. 

The long downloads allow the configuration utility 34 to 
determine how packet loss, network congestion, and server 
utilization affect file delivery. It is not ideal to determine 
which of the foregoing factors is causing decreased perfor- 
mance based on test results from a single user However, 
such results in the aggregate, as stored in the database 
maintained by the MSP 32, indicate the root causes. 

It should be noted that some of the test results may be used 
in conjunction with other test results. For example, the load 
on a delivery site as determined through a capacity query can 
be divided by the results of a throughput test to derive an 
average expected download time, given the load character- 
istics of the server. 

After all specified tests are run, the results are collected 
and processed (step 48). It is possible that certain tests were 
not able to be successfully performed; in such cases, the 
results should indicate an appropriate worst-case value (e.g. 
zero throughput or extremely high delay). 

It is important to note that not all possible tests will be 
performed each time the configuration utility 34 is run. 
When a large number of users is using the system, a 
substantial drain on server and network capacity would be 
caused by the testing procedure alone, increasing the down- 
ward spiral of network performance previously discussed. 

As noted above, a test frequency number is stored in the 
delivery site file for the purpose of dynamically controlling 
the number of users perforaaing a test. The testing performed 
by the configuration utility 34 is performed to achieve 
statistical confidence in deciding which delivery site is best 
suited for data delivery to a particular user. Statistical 
confidence is obtained by either testing a small sample of 
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users sufficiently well and using that data to influence the 
choice of a dehvery site, or by having a large number of 
users each "lightly" test several available sites and using that 
data in the aggregate. 

Accordingly, when the system is used initially, a relatively 
small number of users arc "enrolled" in the system. The 
delivery site file maintained by the MSP 32 reflects those 
conditions, and requires each user to test the network 
(through the configuration utility 34) relatively heavily. As 
the number of users increases, the delivery site file is 
modified to decrease the tests performed by each user. By 
the time a very large number of users are using the system, 
the configuration utility 34 may predominantly test for 
delivery site reachability (via a "ping^*-type test, as discussed 
above), and rely primarily on test data provided by other 
users and stored in the database maintained by the MSP 32. 
However, even when many users are testing the system, a 
small number of users (e.g. one in 5,000) may be selected to 
run a comprehensive set of tests. 

Preferably, testing should not contribute more than 
approximately 5% of total server load. One way to reach this 
goal is to lightly test a large number of servers, yielding a 
group of delivery sites having adequate performance. This 
group of delivery sites can then be used in rotation to 
retrieve data. Information on multimedia clip actual down- 
load times for each of the dehvery sites in the group is 
accumulated as discussed below, and further information on 
delivery site performance can then be furnished to the MSP 
32 transparently, without the need for further outright test- 
ing. 

Accordingly, on the basis of the collected test results, and 
on information provided in the dehvery site file by the MSP 
32, the configuration utifity 34 determines which delivery 
site, or group of delivery sites, is best for the user terminal 
12 (step 50). This determination can be made numerically by 
weighting the various tests performed and comparing the 
score for each site. 

In a presently preferred embodiment, for use in a low- 
trafiBc environment with a relatively small nximber of deliv- 
ery sites and users, the configuration utiUty 34 relies prima- 
rily on ping and throughput tests for each available delivery 
site. An initial ping test is performed to determine if a 
delivery site is reachable. Short and long downloads are 
performed in the throughput tests to determine the maximum 
and minimum throughputs from the dehvery site, and to 
determine whether throughput variation is small enough to 
accommodate the transmission of video data. Accordingly, 
those tests are all given high weights. Other tests, such as 
Iraceroutc, can be performed, and the results reported to the 
MSP 32, without playing a role in the choice of delivery sites 
(such other tests can be given weights of zero, or nearly zero, 
for example). As the size of the system increases, and 
additional dehvery sites and users are enrolled, the site 
selection formula can be altered, by changing the delivery 
site file contents, to reflect changing network conditions. 

In one embodiment of the invention, a proprietary graphi- 
cal interface is provided so that the location of the tiser and 
the locations (both geographic and electronic) of each site 
tested can be displayed on a monitor connected to the user 
terminal 12, allowing a visual indication of the relative 
distances between sites. In one embodiment, the display is 
shown in the form of a "radar screen," upon which the user 
terminal 12 and delivery sites are displayed as "bUps" 
superimposed over a map of the pertinent geographical 
region. In order to encourage the user to use the appKcation 
and to offer more network-wide data, the user interface can 
allow the user to enter an "ad-hoc" test site for additional 
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performance testing. In this case, the configiiration utility 
will test either the default Web page file (e.g., "index.html") 
or a specific file requested by the user. Analysis results from 
the \iser-sclected site arc adjusted so that reasonable com- 
parisons can be made with res\ilts from other sites. 

It should be noted that multiple sets of delivery sites can 
be maintained by the invention, to accommodate several 
groups of content providers. Each content provider might be 
mirrored only at certain sites. Accordingly, for each content 
provider having a unique set of delivery sites, a primary 
delivery site is selected by the configuration utility 34. To 
accomplish this, the foregoing tests can be run once, and 
then, in one embodiment, a numerical weighting can be 
applied to each appropriate set of delivery sites. A pltirality 
of Smart Mirror sites is selected, one for each content 
provider group. The content provider groups is specified in 
the delivery site file; each possible delivery site is identified 
as belonging to one or more content provider groups. When 
content provider groups are used, there can be as few as two 
groups; the maximum number is essentially unlimited. 

It should also be noted that a prioritized ranking of 
delivery sites can also be generated and maintained. If this 
is done, failure of the primary Smart Mirror site to respond 
will allow the system to fall back to the next-highest ranked 
Smart Mirror site. 

After a Smart Mirror site is selected, certain data will be 
sent to the MSP 32 (step 52) via e-mail or other Internet 
electronic protocol. The information received by querying 
the user, the identity of the selected Smart Mirror site, and 
all raw test data and results, including the time and date at 
which each test was run, is compiled into a text file (which 
is encrypted in one embodiment). Upon receipt by the MSP 
32, the data is stored in a database for use in managing and 
analyzing the system. 

Finally, the configuration utility 34 save the identity of the 
selected Smart Mirror site for each set of delivery sites, or 
the prioritized list, to the (encrypted) configuration file (step 
54). The configuration utility may also save information on 
relative performance for each tested delivery site. The client 
program 36 uses the encrypted configuration file to down- 
load data files (video chps or other content) from the 
appropriate Smart Mirror site. 

It should be noted that in the operation of the system, the 
MSP 32 performs certain functions. The MSP 32 maintains 
the delivery site list, adding and deleting sites as necessary. 
The MSP 32 also maintains the database of network 
performance, containing information received via e-mail or 
other means from users nmning the configuration utility 34. 
As large amounts of data are received from numerous users, 
the database can provide valuable information on the per- 
formance and other characteristics of the Internet and por- 
tions thereof. Various data processing techniques are known 
to derive such information. 

The locations of the delivery sites used with the invention 
are ultimately determined by a number of factors, including 
marketing considerations and cost/benefit analyses. 
However, the data stored in the MSP's database can confirm 
the utility of placing a delivery site at a given location on the 
Internet or other network. In one embodiment, servers are 
located on each major backbone (portion of the Internet 
maintained by a single corporation) and on other Internet 
fines serving large numbers of users, such as the major lines 
operated by the Regional Bell Operating Companies 
("RBOCs'*). In certain networks serving large ntimbers of 
users or having heavy video delivery traffic, servers can be 
placed at major Points of Presence ("POPs") for the network 
to ensure that eadi user has excellent to a fast server. 
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Once the configuration utility 34 has been run, the user 
can use the system to enable and facilitate the receipt of data 
files, specifically video clips, audio clips, software 
programs, and other content. 

As time passes and the usage patterns of a xiscr's region 
of the Internet change, the user might become dissatisfied 
with the performance of the Smart Mirror site associated 
with his user terminal 12. If that happens, the user is free to 
re-run the configuration utility 34. By that time, additional 
delivery sites might have been placed into service, or a 
different pre-existing site might perform better than the one 
previoxisly assigned. Furthermore, if the player program 36 
determines that the selected Smart Mirror site is not per- 
forming adequately (e.g., it has failed three times out of ten 
attempts), the player program 36 can prompt the user to 
re -run the configuration utility 34. In other embodiment of 
the invention the testing and mirror assignment is run 
automatically with each request for a file on the Smart 
Mirror service or at some intermittent times such as after 
every other, every third, every tenth or every one hundredth 
request. 

In one embodiment of the invention, the Smart Mirror 
system is used to locate a delivery site from which to 
download a video or audio cHp ("clip") referenced on a Web 
page. In this embodiment, the client program can be referred 
to or considered a "player program." The player program, in 
addition to carrying out the functions of the client program 
36, enables the retrieval and playback of video data. 
Ordinarily, a browser program 38 is run on the user terminal 
12 to view Web content. Browser programs typically used 
include NCSA Mosaic, Netscape Navigator, and Microsoft 
Internet Explorer. The browser program 38 allows the user 
to hotlink among various Web sites on the Internet. 

The EMBED tag is used within HTML documents to 
indicate which Web pages include content managed by the 
system. When the browser program 38 receives a Web page 
containing an EMBED tag, a download of the file referenced 
by the tag is commenced, and the file type is analyzed. If the 
file is of a type handled by the player program 36, e.g. 
MPEG, the browser program 38 initiates the player program 
36. The contents of the tag are then passed by the browser 
program 38 to the player program 36. 

The player program 36 (FIG. 1) provides the Smart 
Mirroring services facifitated by the MSP 32. The operation 
of the player program 36 is shown in detail in FIG. 3. 

The player program first analyzes the EMBED tag to 
determine if there is an "SM" (Smart Mirror) parameter 
(step 60); the presence of such a parameter indicates that the 
embedded clip is enabled for Smart Mirroring. Data asso- 
ciated with the "SM" parameter specifies the particular 
content provider from which the desired clip originated, as 
well as the group of mirror servers that particular content 
provider uses. 

If the player program 36 determines that the EMBED tag 
references a video clip or other content handled by the 
55 system (step 62), the transfer of the embedded clip from the 
content provider 22 is stopped. The player program 36 then 
extracts access control or rating information from the 
EMBED statement (step 64), if any exists. This rating 
information is compared against the reference levels stored 
in the configuration file stored at the user terminal 12 (step 
66). If rating information does not exist for the clip, the 
configuration file is queried to determine whether unrated 
clips, as defined below, may be played (step 68). Based on 
the foregoing information, the player program 36 will autho- 
rize or decline the viewing of the desired clip. 

If playback is authorized, the player program 36 attempts 
to find the referenced cUp on the local computer belonging 
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to the user terminal 12 (Step 70). If it exists there, it is not error rate of the transfer. At some interval (e.g. weekly or 

re-downloaded, and can be played directly on the computer once every 100 downloads), a message containing the accu- 

(from the disk or from RAM) (step 72). However, the time mulated file transfer performance information, as well as the 

and date of creation of the clip on the local computer is first user and server information discussed above, would be sent 

verified against the time and date for the clip available on the S (automatically or upon request) to the MSP 32 (step 106) to 

network, to determine if the stored clip is the most recent update the MSP's database. This additional information 

version (step 74). If it is not, the stored clip is discarded (step increases the MSP's 'Icnowledge" of network performance 

76) and the download proceeds as follows. without incurring any additional testing overhead. 

If the clip does not exist on the local computer, the player This data is especially valuable in ascertaining the per- 

crcates a new URL (step 78) in the following form: "http://", 10 formance of delivery sites, for the purpose of assessing the 

plus the IP address of the selected Smart Mirror site stored quahty of service purchased from the deUvery site provider, 

in the configuration file, plus the path name to mirror files and for documenting the quality of service to content 

(e.g. "/pub/mirror/*'), plus the name of the content provider providers, to support the cost of the system. It is recognized, 

taken from the "SM" parameter in the EMBED statement, however, that much of the same information can be obtained 

plus the filename taken from the EMBED statement The 15 through new users running the configuration utility 34. 

constructed URL is used to retrieve the selected clip from Furthermore, the provision of download information to 

the appropriate Smart Mirror site selected by the configu- the MSP facilitates the use of the invention as a premium 

ration utility 34 (step 80). If more than one set of delivery subscription-based service. As successful downloads are 

sites exists for different content providers, the "SM" param- tracked in a database, each user can have an associated 

eter is further used by the player program 36 to determine 20 "account" to track charges. The user can be charged for use 

which Smart Mirror site in the configuration file is to be used of the Smart Mirror system by the file, by the megabyte, by 

in the constructed URL (step 82). the month, or by other known means. In one embodiment, 

If the clip corresponding to the constructed URL is not the EMBED tag associated with a file contains billing 
found at the Smart Mirror site, or is unable to be accessed, information, or a "price" for the file. The invention's track- 
then the download proceeds from the next-highest ranked 25 ing of download performance allows discounts or credits to 
Smart Mirror site in the configuration file (step 84). If all be issued if downloads are found to be unduly difficult or 
delivery sites fail, the download proceeds from the original slow. 

content provider's site as specified directly by the EMBED To ensure that files stored on Smart Mirror delivery sites 

statement. are used only by authorized users of the invention (e.g. those 

If playback is disallowed, the player prevents the clip 30 users having paid accounts), the files stored at the delivery 

from being transferred (step 88) and displays a bitmap (step sites can optionally be in encrypted form, and the down- 

90) advising the user that the download is not be permitted. loading step described above can include a decryption step. 

If the player program 36 determines that the EMBED tag Such encryption and decryption can be performed by well 

references a video clip or other content not handled by the known means. 

system, the player will check whether the access control 35 As discussed above, the clips managed by the invention 

level set in the configuration file allows the user to play these can have content rating information associated therewith, 

chps or other files which are considered "unrated" (step 92), This is accomplished by providing a "PG" parameter in the 

If so, the clip is transferred fi-om its original content provider EMBED statement corresponding to the clip. In one 

22 by traditional means (step 94), and the player program 36 embodiment, four characteristics are rated: nudity, sexuality, 

displays the downloaded file (step 96). If not, the player 40 profanity, and violence. Accordingly, the PG parameter can 

prevents the clip from being transferred (step 98) and be specified by a four-digit argument. Each characteristic is 

displays a message (step 100) advising the user that the rated on a scale of one to three. One corresponds to no 

download is not permitted. filtering (i.e. all content is allowable), two corresponds to 

Upon download, the data file representing the desired clip some filtering (e.g. equal to levels typically allowed in 

is stored within a specified data area on the local computer, 45 broadcast television), and three corresponds to the most 

usually on the hard disk, belonging to the user terminal 12 extensive filtering (e.g. for children). The ratings levels 

(step 102). In one embodiment, this data area can be contained in the EMBED statement for a file are compared 

managed on a least-recently-used basis by the player pro- to the ratings filter levels contained in the configuration file 

gram 36. That is, if no room in the data area remains for a stored at the user terminal 12 in the foregoing authorization 

new clip, the least-recently-used (or viewed) clip or clips can 50 process, and only authorized files are transferred, 

be discarded to make room (step 104). ' In view of the above, it will be appreciated that embodi- 

In one embodiment of the invention, the client program 36 ments of the invention may be employed in many different 

is capable of sending messages to the MSP 32 (step 106) to applications to permit the acquisition and analysis of per- 

reflect whether downloads were successfid. This message formance data for networks between a given user and 

contains the Internet address of the user terminal 12, the 55 content provider or delivery site. Thus, although the 

identity of the selected server set, the Internet address of the described embodiment illustrates the system operating 

site used to accomplish the download, the Internet addresses within the context of the Internet, it is recognized that such 

of all sites which failed, the name of the file downloaded, a system could prove to be useful in other network 

and the time to download the file. This information can also environments, such as corporate "intranets." 

be used by the MSP 32 to track file dovmloads and to 60 Moreover, aUhough the illustrative embodiments are 

determine, in real time, whether there are any problems with described primarily for use in a video delivery system, it 

any Smart Mirror sites. should be recognized that a system according to the inven- 

Alteraatively, the chent program 36 can maintain a smaU tion can be used to distribute various other kinds of corn- 
local database of file transfer performance. Each download puter data (e.g. application programs, database files and 
would then be timed. Specifically, information can be gath- 65 other business information, virtual reality files, multimedia 
ered on the time it takes a server to begin sending the such as Macromedia Shockwave files, and large text files 
requested file, the stability of the data transfer rate, and the such as books) as well. Such other types of data can be 



05/24/2004, EAST Version: 1.4.1 



6,154 

17 

managed by ibe invention in different content provider 
groups as discussed in detail above; a different type of 
program (rather than the player program 36) typically would 
be invoked at the user terminal 12 to view or use other types 
of data. S 

It should also be noted that certain functionality described 
as performed at the user terminal 12 (specifically, certain 
functions performed by the configuration utility 34, or 
client/player program 36) can be implemented as a standa- 
lone program, as a "plug-in" or "helper application" to run 
within a browser program, or as a Java applet downloaded 
from a delivery site to run within a browser environment. 
For user temainals capable of running the Microsoft Win- 
dows operating system, an environment known as Microsoft 
ActiveX is also useful. 

While certain exemplary structiu'es and operations have 
been described, the invention is not so limited, and its scope 
is to be determined according to the claims set forth below. 

What is claimed is: 

1. A system for selecting a preferred delivery site from ^ 
which to receive a file, the preferred deUvery site being one 

of a plurality of delivery sites located on a distributed 
network, at least one of the delivery sites storing at least one 
copy of the file, comprising: 

at least one content provider connected to the network, the 

content provider storing at least one file; 
a user terminal connected to the network; and 
a network traffic testing apparatus connected to the user 
terminal for selecting the preferred delivery site from 
the plurality of delivery sites, 30 
wherein the file is downloaded from the preferred delivery 
site to the user terminal. 

2. The system as recited in claim 1, wherein the file is a 
graphics file. 

3. The system as recited in claim 1, wherein the file is a 35 
text file. 

4. The system as recited in claim 1, wherein the file is a 
video file. 

5. The system as recited in claim 1, wherein the file is an 
audio file. 4o 

6. The system as recited in claim 1, wherein the network 
testing apparatus comprises a first software program for 
testing the network and selecting the preferred delivery site 
from all delivery sites connected to the network. 

7. The system as recited in claim 6, further comprising a 45 
downloading apparatus connected to the user terminal, the 
downloading apparatus including a second software pro- 
gram capable of downloading the file from the preferred 
delivery site to the user terminal. 

8. The system as recited in claim 7, further comprising 50 
one or more domain name service (DNS) servers connected 

to the network, wherein the DNS servers translate the 
domain names of the delivery sites to their Interact Protocol 
(IP) addresses. 

9. The system as recited in claim 8, wherein the DNS 55 
servers are connected in a hierarchy. 

10. The system as recited in claim 1, further comprising 
a database connected to the network, said database having a 
list of available delivery sites. 

U. The system as recited in claim 10, wherein the 60 
network testing apparatus analyzes the network and selects 
the preferred delivery site from the list of delivery sites 
maintained by the database. 

12. The system as recited in claim U, wherein the 
network testing apparatus conducts one or more Ping tests to 65 
determine the round trip travel time between the tiser 
terminal and one or more delivery sites. 
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13. The system as recited in claim 11, wherein the 
network testing apparatus conducts one or more Trace-route 
tests. 

14. The system as recited in claim 11, wherein the 
network testing apparatus conducts one or more Reverse 
Trace -route tests. 

15. The system as recited in claim 11, wherein the 
network testing apparatus conducts one or more Dynamic 
Trace-route tests, 

16. The system as recited in claim U, wherein the 
network testing apparatus conducts one or more Throughput 
tests. 

17. Hie system as recited in claim 11, wherein the 
network testing apparatus conducts one or more Throughput 
variation tests. 

18. The system as recited in claim 11, wherein the 
network testing apparatus calculates the Error Rates between 
one or more delivery sites and the user terminal. 

19. The system as recited in claim 11, wherein the 
network testing apparatus conducts one or more Packet 
Fragmentation tests to determine if transmission from a 
delivery site is subject to fragmentation. 

20. The system as recited in claim 11, wherein the 
network testing apparatus calculates the transmission capac- 
ity of one or more delivery sites. 

21. The system as recited in claim 11, wherein the 
network testing apparatus calculates the average load of one 
or more delivery sites. 

22. The system as recited in claim 11, wherein the 
network testing apparatus creates a prioritized list of pre- 
ferred delivery sites based on the network analysis, 

23. The system as recited in claim 22, further comprising: 
means for routing traffic towards underutilized regions of 

the network based on the network analysis results, 
thereby decreasing the time to download the file. 

24. The system as recited in claim 22, wherein the nesting 
testing apparatus comprises a configuration utility file that 
downloads a delivery site file, the delivery site file including 
a list of available delivery sites and a list of network tests to 
be run. 

25. The system as recited in claim 11, further comprising 
a graphical interface displaying the location of the user 
terminal and the locations of each delivery site tested. 

26. The system as recited in claim 25, wherein the 
graphical interface displays the geographic and electronic 
distance of the delivery sites from the user terminal. 

27. The system as recited in claim 26, wherein the user 
terminal and the delivery sites are displayed on a monitor as 
"blips'* superimposed over a map of the pertinent geographi- 
cal region. 

28. The system as recited in claim 27, further comprising; 
means for encrypting the files prior to storing in the 

delivery sites; and 
means for decrypting the files when they are downloaded 
from the delivery site. 

29. A method for selecting a preferred delivery site from 
which to receive a file, the preferred delivery site being one 
of a plurality of delivery sites located on a distributed 
network, at least one of the delivery sites storing at least one 
copy of the file, the method comprising the steps of: 

connecting at least one content provider to the network, 
said content provider storing at least one file; 

connecting a user terminal to the network; 

testing the network using a network traffic testing appa- 
ratus connected to the user terminal; 

selecting the preferred delivery site from the plurality of 
delivery sites based on the results of the test; and 

downloading the file firom the preferred delivery site to 
the user terminal 
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30. The method as recited in claim 29, wherein the file is 
a graphics file. 

31. The method as recited in claim 29, wherein the file is 
a video file. 

32. The method as recited in claim 29, wherein the file is s 
an audio file. 

33. The method as recited in claim 29, wherein the file is 
a text file. 

34. The method as recited in claim 29, wherein the 
network testing apparatus comprises a first software pro- lO 
gram for testing the network and selecting the preferred 
delivery site firom all delivery sites connected to the net- 
work. 

35. The method as recited in claim 34, further comprising 
the step of connecting a downloading apparatus to the user 15 
terminal, the downloading apparatus including a second 
software program capable of downloading the file from the 
preferred delivery site to the user lemainal. 

36. The method as recited in claim 35, further comprising 
the steps of connecting one or more domain name service 20 
(DNS) servers to the network; and translating the domain 
names of the delivery sites to their respective Internet 
Protocol (IP) addresses. 

37. The method as recited in claim 36, wherein the DNS 
servers are connected in a hierarchy. 25 

38. The method as recited in claim 29, further comprising 
the step of connecting a database to the network, wherein the 
database includes a list of available delivery sites. 

39. The method as recited in claim 38, further comprising 
the step of selecting the preferred delivery site from the list 30 
of delivery sites maintained by the database. 

40. The method as recited in claim 39, further comprising 
the steps of: 

analyzing the network using the network testing appara- 
tus; and ^5 

selecting the preferred delivery site from the list of 
delivery sites maintained by the database based on the 
results of the network analysis. 

41. The method as recited in claim 40, further comprising 
the step of creating a prioritized list of preferred delivery ^° 
sites based on the network analysis. 

42. The method as recited in claim 41, further comprising 
the step of routing traffic towards underutilized regions of 
the network based on the network analysis results, thereby 
decreasing the time to download the file. 

43. The method as recited in claim 42, further comprising 
the step of downloading a delivery site file using a configu- 
ration utility file, the delivery site file including a list of 
available delivery sites and a list of network tests to be run. 

44. The method as recited in claim 43, further comprising 
the step of displaying the location of the user terminal and 
the locations of each delivery site by a graphical interface. 

45. The method as recited in claim 44, further comprising 
the step of displaying the geographic and electronic distance 
of the delivery sites from the user terminal by the graphical 
interface. 
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46. The method as recited in claim 45 further comprising 
the step of displaying the user terminal and the delivery sites 
as "bhps" superimposed over a map of the pertinent geo- 
graphical region. 

47. The method as recited in claim 46, further comprising 
the steps of: 

selecting a subset of delivery sites from the list; and 
running at least one test for each of the dehvcry sites in 
the subset. 

48. The method as recited in claim 47, wherein the test is 
selected from a plurality of tests. 

49. The method as recited in claim 48, further comprising 
the step of weighting the results of the network analysis test. 

50. The method as recited in claim 49, wherein the 
weighting steps utiHze weights specified in the list of 
delivery sites. 

51. A method for downloading a file from a delivery site, 
comprising the steps of: 

determining whether the file is mirrored on one or more 
delivery sites; 

if the file is mirrored, identifying at least one preferred 
delivery site according to the results of at least one 
network traffic test; and 

downloading the file from the preferred delivery site. 

52. The method as recited in claim 51, further comprising 
the step of repeating the identifying and downloading steps 
until downloading is successful 

53. The method as recited in claim 52, further comprising 
the step of downloading the file from a content provider if 
the repeated dovraloading steps are unsuccessful. 

54. The method as recited in claim 51, wherein the file is 
a graphics file. 

55. The method as recited in claim 51, wherein the file is 
a text file. 

56. The method as recited in claim 51, wherein the file is 
a video file, 

57. The method as recited in claim 51, wherein the file is 
an audio file. 

58. A method for locating and maintaining a network of 
delivery sites for the optimized storage and retrieval of files, 
comprising the steps of: 

receiving network traffic test results from a plurality of 
users; 

storing the test results in a database; 
processing the test results; 

deteraaining network performance at various locations on 

the network; and 
distributing the delivery sites and the graphics files 

throughout the network according to the determined 

network performance. 

♦ * ♦ ♦ ♦ 
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